Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Détection du fondamental de la parole en temps réel : application aux voix pathologiques

Identifieur interne : 001162 ( Main/Exploration ); précédent : 001161; suivant : 001163

Détection du fondamental de la parole en temps réel : application aux voix pathologiques

Auteurs : Fadoua Bahja [Maroc]

Source :

RBID : Hal:tel-00927147

Descripteurs français

Abstract

This thesis is part of researches aimed at determining the fundamental frequency of speech signals. The first contribution is related to the development of real time pitch detector algorithms, based on an implicit circular autocorrelation of the glottal excitation. Among all the pitch detection algorithms described in the literature, few of them are able to tackle correctly all the problems of pitch tracking. For this reason, we expanded our scope of investigation and proposed new algorithms based on wavelet transforms. To evaluate the performances of the proposed algorithms, we used two databases : Bagshaw and Keele. The results we obtained prove that our developed algorithms compare favourably with the best reference pitch detector algorithms described in the literature. The second contribution of this thesis concerns the implementation of a voice conversion system in order to enhance the pathological voice. In this case, we talk about a correction system. Our main contribution, concerning voice conversion, lies in the prediction of Fourier cepstral coefficients related to the excitation signal. This new kind of prediction allowed us to implement conversion systems whose results, either they are objective or subjective, validate the proposed approach.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr">Détection du fondamental de la parole en temps réel : application aux voix pathologiques</title>
<author>
<name sortKey="Bahja, Fadoua" sort="Bahja, Fadoua" uniqKey="Bahja F" first="Fadoua" last="Bahja">Fadoua Bahja</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-176072" status="OLD">
<orgName>Laboratoire LRIT, CNRST URAC 29</orgName>
<desc>
<address>
<addrLine>Rabat, Morocco</addrLine>
<country key="MA"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-301054" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301054" type="direct">
<org type="institution" xml:id="struct-301054" status="VALID">
<orgName>Université Mohammed 5 Agdal</orgName>
<desc>
<address>
<country key="MA"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Maroc</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:tel-00927147</idno>
<idno type="halId">tel-00927147</idno>
<idno type="halUri">https://tel.archives-ouvertes.fr/tel-00927147</idno>
<idno type="url">https://tel.archives-ouvertes.fr/tel-00927147</idno>
<date when="2013-06-15">2013-06-15</date>
<idno type="wicri:Area/Hal/Corpus">005A77</idno>
<idno type="wicri:Area/Hal/Curation">005A77</idno>
<idno type="wicri:Area/Hal/Checkpoint">001078</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">001078</idno>
<idno type="wicri:Area/Main/Merge">001173</idno>
<idno type="wicri:Area/Main/Curation">001162</idno>
<idno type="wicri:Area/Main/Exploration">001162</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="fr">Détection du fondamental de la parole en temps réel : application aux voix pathologiques</title>
<author>
<name sortKey="Bahja, Fadoua" sort="Bahja, Fadoua" uniqKey="Bahja F" first="Fadoua" last="Bahja">Fadoua Bahja</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-176072" status="OLD">
<orgName>Laboratoire LRIT, CNRST URAC 29</orgName>
<desc>
<address>
<addrLine>Rabat, Morocco</addrLine>
<country key="MA"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-301054" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301054" type="direct">
<org type="institution" xml:id="struct-301054" status="VALID">
<orgName>Université Mohammed 5 Agdal</orgName>
<desc>
<address>
<country key="MA"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Maroc</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="fr">
<term>Fréquence fondamentale</term>
<term>auto corrélation circulaire</term>
<term>classification de voisement</term>
<term>conversion de voix</term>
<term>correction de voix</term>
<term>excitation cepstrale</term>
<term>impulsion cepstrale</term>
<term>modèle de mélange Gaussien</term>
<term>période de pitch</term>
<term>quantification vectorielle</term>
<term>temps-réel</term>
<term>transformation en ondelettes</term>
<term>vote majoritaire</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This thesis is part of researches aimed at determining the fundamental frequency of speech signals. The first contribution is related to the development of real time pitch detector algorithms, based on an implicit circular autocorrelation of the glottal excitation. Among all the pitch detection algorithms described in the literature, few of them are able to tackle correctly all the problems of pitch tracking. For this reason, we expanded our scope of investigation and proposed new algorithms based on wavelet transforms. To evaluate the performances of the proposed algorithms, we used two databases : Bagshaw and Keele. The results we obtained prove that our developed algorithms compare favourably with the best reference pitch detector algorithms described in the literature. The second contribution of this thesis concerns the implementation of a voice conversion system in order to enhance the pathological voice. In this case, we talk about a correction system. Our main contribution, concerning voice conversion, lies in the prediction of Fourier cepstral coefficients related to the excitation signal. This new kind of prediction allowed us to implement conversion systems whose results, either they are objective or subjective, validate the proposed approach.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Maroc</li>
</country>
</list>
<tree>
<country name="Maroc">
<noRegion>
<name sortKey="Bahja, Fadoua" sort="Bahja, Fadoua" uniqKey="Bahja F" first="Fadoua" last="Bahja">Fadoua Bahja</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001162 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001162 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:tel-00927147
   |texte=   Détection du fondamental de la parole en temps réel : application aux voix pathologiques
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022